Model Selection

OCR-free Document Understanding

# OCR-free Document Understanding

Vietable Donut Docvqa Demo

A fine-tuned version of the Donut model for Vietnamese document question answering (table data)

Question Answering System

Transformers Other

mPLUG-DocOwl2 is an OCR-free multimodal large language model for multi-page document understanding, efficiently encoding document content via a high-resolution document compressor.

Safetensors English

Donut Base Finetuned Rvlcdip

Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder to process document images.

Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder for image-to-text conversion

Donut is an OCR-free document understanding Transformer model composed of a visual encoder (Swin Transformer) and a text decoder (BART).

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase